AITopics | pricing policy

StatefulPostedPricingwithVanishingRegretvia DynamicDeterministicMarkovDecisionProcesses

Neural Information Processing SystemsFeb-7-2026, 17:55:08 GMT

A very useful method for devising online posted prices is viavanishing-regret online learning algorithms inanadversarial environment [12,11,23,9,10,28].

artificial intelligence, machine learning, online, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > Canada > Quebec > Montreal (0.05)
Asia > Middle East > Israel > Haifa District > Haifa (0.05)
(8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.67)

Add feedback

1673a54332b2afc905722048c26f5a4c-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 15:35:35 GMT

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

1673a54332b2afc905722048c26f5a4c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 15:35:31 GMT

We propose a randomized dynamic pricing policy based on a variant of the Online Newton Stepalgorithm (ONS)thatachievesaO(d T log(T))regretguarantee underan adversarial arrival model.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Beyond Demand Estimation: Consumer Surplus Evaluation via Cumulative Propensity Weights

Bian, Zeyu, Biggs, Max, Gao, Ruijiang, Qi, Zhengling

arXiv.org Machine LearningJan-6-2026

This paper develops a practical framework for using observational data to audit the consumer surplus effects of AI-driven decisions, specifically in targeted pricing and algorithmic lending. Traditional approaches first estimate demand functions and then integrate to compute consumer surplus, but these methods can be challenging to implement in practice due to model misspecification in parametric demand forms and the large data requirements and slow convergence of flexible nonparametric or machine learning approaches. Instead, we exploit the randomness inherent in modern algorithmic pricing, arising from the need to balance exploration and exploitation, and introduce an estimator that avoids explicit estimation and numerical integration of the demand function. Each observed purchase outcome at a randomized price is an unbiased estimate of demand and by carefully reweighting purchase outcomes using novel cumulative propensity weights (CPW), we are able to reconstruct the integral. Building on this idea, we introduce a doubly robust variant named the augmented cumulative propensity weighting (ACPW) estimator that only requires one of either the demand model or the historical pricing policy distribution to be correctly specified. Furthermore, this approach facilitates the use of flexible machine learning methods for estimating consumer surplus, since it achieves fast convergence rates by incorporating an estimate of demand, even when the machine learning estimate has slower convergence rates. Neither of these estimators is a standard application of off-policy evaluation techniques as the target estimand, consumer surplus, is unobserved. To address fairness, we extend this framework to an inequality-aware surplus measure, allowing regulators and firms to quantify the profit-equity trade-off. Finally, we validate our methods through comprehensive numerical studies.

artificial intelligence, estimator, machine learning, (16 more...)

arXiv.org Machine Learning

2601.01029

Country:

North America > United States > California (0.14)
North America > United States > Virginia (0.04)
North America > United States > Texas > Reagan County (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback

67e235e7f2fa8800d8375409b566e6b6-Paper.pdf

Neural Information Processing SystemsNov-14-2025, 04:22:45 GMT

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > California (0.14)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets

Yang, Zixian, Varma, Sushil Mahavir, Ying, Lei

arXiv.org Artificial IntelligenceOct-17-2025

We study a two-sided market, wherein, price-sensitive heterogeneous customers and servers arrive and join their respective queues. A compatible customer-server pair can then be matched by the platform, at which point, they leave the system. Our objective is to design pricing and matching algorithms that maximize the platform's profit, while maintaining reasonable queue lengths. As the demand and supply curves governing the price-dependent arrival rates may not be known in practice, we design a novel online-learning-based pricing policy and establish its near-optimality. In particular, we prove a tradeoff among three performance metrics: $\tilde{O}(T^{1-γ})$ regret, $\tilde{O}(T^{γ/2})$ average queue length, and $\tilde{O}(T^γ)$ maximum queue length for $γ\in (0, 1/6]$, significantly improving over existing results [1]. Moreover, barring the permissible range of $γ$, we show that this trade-off between regret and average queue length is optimal up to logarithmic factors under a class of policies, matching the optimal one as in [2] which assumes the demand and supply curves to be known. Our proposed policy has two noteworthy features: a dynamic component that optimizes the tradeoff between low regret and small queue lengths; and a probabilistic component that resolves the tension between obtaining useful samples for fast learning and maintaining small queue lengths.

artificial intelligence, machine learning, ptq, (18 more...)

arXiv.org Artificial Intelligence

2510.14097

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.81)

Industry: Education > Educational Setting > Online (0.60)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.60)

Add feedback

PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks

Liu, Yunuo, Zhu, Dawei, Al-Khalili, Zena, Cheng, Dai, Chen, Yanjun, Klakow, Dietrich, Zhang, Wei, Shen, Xiaoyu

arXiv.org Artificial IntelligenceOct-15-2025

We present PricingLogic, the first benchmark that probes whether Large Language Models(LLMs) can reliably automate tourism-related prices when multiple, overlapping fare rules apply. Travel agencies are eager to offload this error-prone task onto AI systems; however, deploying LLMs without verified reliability could result in significant financial losses and erode customer trust. PricingLogic comprises 300 natural-language questions based on booking requests derived from 42 real-world pricing policies, spanning two levels of difficulty: (i) basic customer-type pricing and (ii)bundled-tour calculations involving interacting discounts. Evaluations of a line of LLMs reveal a steep performance drop on the harder tier,exposing systematic failures in rule interpretation and arithmetic reasoning.These results highlight that, despite their general capabilities, today's LLMs remain unreliable in revenue-critical applications without further safeguards or domain adaptation. Our code and dataset are available at https://github.com/EIT-NLP/PricingLogic.

large language model, machine learning, qwen2, (15 more...)

arXiv.org Artificial Intelligence

2510.12409

Country: